Kirkpatrick et al.

mentions 1 type Person feed RSS

// recent coverage 1 mentions

12:12

2026-06-23

lesswrong.com

large-language-models

Catastrophic Forgetting and Safety Erosion Are Driven by the Same Mechanism and Should Be Monitored by the Same Tools

Researchers have found that catastrophic forgetting and safety erosion in large language models are driven by the same gradient-interference mechanism, suggesting that tools used to monitor and mitiga…

// co-occurs with top 7 entities

EWC 1 SafeGrad 1 GEM 1 Yi et al. 1 Lopez-Paz & Ranzato 1 Zhang et al. 1 Qi et al. 1